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(54) Method and apparatus for software distribution 



(57) A software distribution system for software bun- 
dles comprising a plurality of packages, at least two of 
which are required by a different set of nodes on a net- 
work. Rather than send ail of the packages constituting 



the software bundle to each node, only the packages in 
the software bundle which are required by nodes below 
a certain link in the network are sent along that link, 
whereby to minimize the bandwidth required for trans- 
mitting the software bundle. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to soltware dis- 
tribution and in particular to a method of software distri- 
bution across a network which minimizes the overall 
bandwidth used to transmit the software. 
[0002] It is well known to transmit software packages 
across a network from a server to one or more clients 
which require the package. Often, newer versions of a 
package will be released and will need distributing to 
clients running the older version of the software. 
[0003] The packages which are being distributed ac- 
cording to the invention are sets of files, optionally with 
accompanying scripts. The scripts are run at installation 
time, and might for example be pre-install scripts run 
before files are installed or post-install scripts run after- 
ward. 

[0004] In certain circumstances, it is not imperative 
that all systems running the older software be upgraded 
or it is not possible for a server to have a record of all 
the systems running the software. In these circumstanc- 
es, clients running the software could periodically poll 
the sen/er which might or might not have a new version 
of the software, and request that the software be sent 
to the client if the server informs the client that it has a 
newer version. This approach is taken by many com- 
mercial software distributors who make upgrades of 
their products available over the internet. 
[0005] In many situations, the above solution to soft- 
ware distribution is not acceptable. For example, if the 
software in question has never been installed on a client 
system, the client system has no way of knowing it 
should request the new software. Furthermore, software 
upgrades often need installing immediately to maintain 
overall system integrity. In view of these situations, and 
other similar situations, it has become necessary to pro- 
vide distribution software that actively distributes soft- 
ware packages from a server to one or more clients, 
rather than waiting for the clients to request the soft- 
ware. An example of such a system is Platinum Tech- 
nology, Inc.'s AutoXfer product. 
[0006] Software distribution packages are commonly 
hierarchical, in that files in the software distribution 
package can be logically grouped into smaller packages 
which might be required on certain clients but not on oth- 
ers. Such packages, containing a set of files, are re- 
ferred to as simple packages, and might for example 
consist of the files constituting a single application. A 
software package might also contain one or more com- 
posite packages which do not contain actual files, but 
instead contain references to two or more simple or 
composite packages which constitute the composite 
package. For example, a composite package might rep- 
resent a suite of applications, the simple packages for 
which are referenced by the composite package. In or- 
der to send a composite package between nodes in a 



network, the packages referred to in the composite 

package need to be sent, along with the composite 
package. In addition, a composite package may also re- 
quire that its referenced packages be installed in the or- 
5 der specified by its definition, even if it has no contingent 
scripts. 

[0007] During repackaging there are two kinds of 
composite packages to consider. An open package has 
no constraints upon the order or processing of its con- 

10 tents, but is simply an aggregation of other packages. 
The contents of an open package can be freely manip- 
ulated without compromising the package itself, and its 
presence is implied by the presence of its contents. A 
closed package has constraints that require that its con- 

15 tents be installed in the context of the package (instal- 
lation) itself, and its presence must be explicitly indicat- 
ed. 

[0008] Often, target nodes on the network will already 
have one or more of the simple packages referred to in 
20 a composite package installed before the composite 
package is sent. If the composite package is an open 
package, it is highly wasteful of bandwidth to send sim- 
ple packages already installed to a target node, as the 
simple packages are not required to complete installe- 
rs tion of the composite package. 

[0009] Standards have been implemented for soft- 
ware distribution formats, such as the Posix 1387.2 
standard. It uses four prinoary objects: Fileset, Subprod- 
uct. Product, and Bundle. This standard essentially 
30 specifies installation format in a Unix environment. It 
calls upon other Posix standards for definitions and re- 
quirements used in the standard. 
[0010] Net News uses a filtering mechanism in order 
to send news articles to network machines, such that 
35 only what is requested and required be sent. There is 
no hierarchical structure to the articles, however, so the 
same problems do not arise as with software distribu- 
tion. 

40 STATEMENT OF INVENTION 

[0011] The present invention provides a method and 
apparatus for transmitting a software package made up 
of other smaller packages across a network in a tree 

45 distribution pattern In which the transmitting node is the 
root, and clients requiring some part ot the package are 
certain nodes in the tree. According to the invention, in- 
formation is provided to nodes at branches in Ihe tree 
concerning which parts of the composite package are 

so required by the clients to which that node is supplying 
the package, The branch node transmits only parts of 
the package to each of its child nodes which are required 
by nodes below and including that child node in the tree. 
Each package constituting the software package can be 

55 unpacked once any packages upon which it is contin- 
gent have been unpacked. 

[0012] The invention further provides a method and 
apparatus for transmitting a software package made up 
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of other smaller packages across a network by sending 
each of the smaller packages individually in a tree dis- 
tribution pattern only to the nodes on which the smaller 
package in question is required, in which the transmit- 
ting node is the root, and clients requiring the smaller 
package are certain nodes In the tree. Each of the small- 
er packages constituting the software package can be 
unpacked once any packages upon which it is contin- 
gent have been unpacked. 

[0013] In one aspect of the invention, the packages 
forming the software package are sent as a single entity, 
and repackaging is carried out at branch nodes in the 
tree when all nodes further down the tree along a par- 
ticular branch do not require all the packages forming 
the package received at the branch node to be sent. A 
repackaged software package passed down each of the 
branches then contains only the packages which are re- 
quired by nodes below that branch. 
[0014] In another aspect of the invention, different 
packages forming a software package are multicasted 
independently to target nodes at which they are re- 
quired, and then each package is unpacked once all the 
packages required at a target node have been received. 
[0015] These and other objects of the invention will 
bo apparent from the remaining portion of this specifi- 
cation. 



BRIEF D ESCRIPTION OF THE DRAWIMftg 

[001 6] FIGURE 1 shows an example of a network on 
which the invention might be implemented. 
[0017] FIGURE 2 shows an example of a spanning 
tree across the network shown in Figure 1. 
[0018] FIGURE 3 shows the structure of a composite 
package according to a first example of the present in- 
vention. 

[001 9] FIGURE 4A shows the structure of a compos- 
ite package according to an example of the present in- 
vention. 

[0020] FIGURE 4B shows a first example of the dis- 
tribution of the composite package shown in Figure 3A. 
[0021] FIGURE 4C shows a second example of the 
distribution of the composite package shown in Figure 
3 A. 

[0022] FIGURE 5 shows an array generated by the 
first embodiment in establishing which packages are to 
be sent along each link from node Q in Figure 4C 
[0023] FIGURES 6A and 6B show different spanning 
trees as implemented in a second embodiment of the 
invention. 

DETAILED DESCRIPTION 

[0024] A first embodiment of the invention will herein- 
after be described with reference to Figures 1 -3. 
[0025] Figure 1 shows an example network on which 
the invention could be used. Such a network has nodes 
2 which can act to route data over more than one net- 



work link 4. hereinafter referred to as branch nodes. Da- 
ta can be received along at least one network link, and 
directed to other network links from a branch node 2. In 
a standard configuration, data could be received from 
5 or transmitted along any of the network links 4 connect- 
ed to the branch node. These branch nodes might, for 
example, be routers of types commonly used on packet 
switched networks. 

[0026] A spanning tree across the network is gener- 
10 ated from a distribution server (DS) 1 0 from which a soft- 
ware package 30 is to be distributed to a set of target 
nodes 14 which need to receive the software, with 
branching nodes 2 in the network acting as branches in 
the tree. Target nodes are represented with the letter "T" 
^5 in Figures 1 and 2. Target nodes might be leaves 16 of 
the tree or might be branching nodes 2 in the tree. Such 
a spanning tree to reach the targets is shown in Figure 2. 
[0027] The tree does not need to be explicitly defined, 
and could, for example, be implicit in routing tables on 
20 each of the branch nodes 2 which forward data as is well 
known in the art. That is to say. each of the branch nodes 
maintains a routing table giving the shortest route cur- 
rently available to a particular node. Such an architec- 
ture is well known, and there are well devetoped algo- 
us hthms for maintaining routing tables which will provide 
appropriate routing to generate spanning trees from any 
source to any plurality of destinations. Algorithms and 
protocols which are used vary depending on the network 
architecture being used at the section of the network in 
30 question. For example, there are specific algorithms and 
protocols used for bridge connected LANs, one of which 
is described in IEEE 801 . Id Such an architecture for use 
on Internet based systems is laid out in RFC 1716. which 
gives algorithms for efficiently routing Internet packages 
35 based on their IP addresses. The routing tables, or 
equivalent routing structures could be static, and updat- 
ed manually, or dynamic based on communication be- 
tween the nodes using a routing protocol such as the 
OSPF (Open Shortest Path First) Protocol. It should be 
40 noted that as long as the routing mechanism guarantees 
that the same route is always taken between any two 
nodes in a network, distribution from one node to a plu- 
rality of nodes will always occur in a spanning tree pat- 
tern. 

45 [0028] Each of the branching nodes in the tree needs 
to be provided with information regarding which nodes 
require particular simple packages 34 and composite 
packages 32 constituting an overall software package 
30 to be sent. An example of such a software package 

50 is shown in Figure 3A. According to this embodiment, 
this determination is made at the DS 10 and can be in- 
cluded in a transfer control file sent along with the soft- 
ware package 30. 

[0029] According to the first embodiment, when a 
5S package is installed on a target node 1 4, a record of that 
installation is kept, so that future transfers to that target 
node need to include only what is required This record 
might be kept at the target itself, and the target polled 
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by the DS 10 whenever new or updated software is to 
be installed, or it might be maintained by the DS so that 
it does not have to poll all the targets. A combination of 
these two methods is possible. 

[0030] The information required to determine which 
target nodes require which packages, can accordingly 
either be achieved by querying all the target nodes with 
regard to what packages are already installed thereon, 
or keeping track of the packages installed on each node 
as they are installed. If for any reason, the packages 
installed on a particular target node cannot be ascer- 
tained, the entire software package 30 can be sent to 
that target node. 

[0031] Certain of the branch nodes 2 operate to 
repackage data in a software package 30 as will become 
apparent. Such branch nodes are referred to hereinafter 
as repackaging nodes 12. According to this embodi- 
ment, these nodes are provided with storage media ca- 
pable of storing an entire software package. However, 
embodiments are also envisaged in which the repack- 
aging required by this embodiment of the invention is 
carried out "on the fly" without requiring sufficient space 
to store the software package. 

[0032] It is important that each of the repackaging 
nodes 12 is provided with all of the target nodes 14 to 
which it will ultimately be transmitting the data, and the 
packages which are required by each of these target 
nodes. This can easily be achieved in many ways. Two 
examples are as follows: 

1 ) Data is transmitted with the software package 30 
from the DS 10 including a list of all the target nodes 
14 to which the package is to be transmitted. Any 
branch node 2 receiving the package, checks its 
routing table, or equivalent data store, to ascertain 
the shortest route to all of the nodes to which it is to 
transmit the package. It will send appropriate parts 
of the package along each of the appropriate 
branches to reach the targets. The branch node 2 
edits the list of target nodes it forwards along each 
branch to include only the target nodes to be 
reached along the respective branch. 

2) Each repackaging node 1 2 is provided with func- 
tionality to ascertain all targets for which It Is respon- 
sible for transmitting data originating from any par- 
ticular node. The list of target nodes does not need 
to be edited at each branch node,, as long as the 
repackaging node knows where the packet originat- 
ed. The list of target nodes could be stored in a sin- 
gle transfer control file as discussed above. 

[0033] The following example Illustrates how compos- 
ite packages are sent according to the first embodiment 
of the present invention. As shown in Figure 3, Compos- 
ite package A consists of references to simple packages 
B and C. The individual installation of package B is: 



1 ) Run B's pre-install script 

2) Install B's files 
5 3) Run B's post-install script 

The Individual installation of package C is: 

1 ) Run C's pre-install script 

10 

2) Install C's tiles 

3) Run C's post-install scnpt. 
IS However, the installation of composite package A is: 

1) Run A's pre-install script 
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2) Run B's pre-install script 

3) Install B's files 

4) Run B's post-install script 

5) Run C's pre-install script 

6) Install C's files 

7) Run C's post-install script 

8) Run A's post-install script 

[0034] If package B and package C have been indi- 
vidually installed on a target (in separate transfers) it can 
be inferred that package A is also installed, provided that 
package A Is not contingent upon its scripts. When a 
package is contingent upon its scripts, It means that the 
scripts must be executed for It to be considered effec- 
tively installed. 

[0035] For a given transfer, software package con- 
tents are gathered up at a DS and sent to target nodes 
along the hierarchy of nodes in the spanning tree. The 
DS sends the package to the first-level branch nodes in 
the tree, each of these sends it to the next level, and so 
on until the package has been sent to all the target 
nodes of the transfer The intermediate nodes used to 
forward the package may also be target nodes. 
[0036] Repackaging can occur whenever subsequent 
targets (those including and below the one being sent 
to) require only a portion of the contents of the package. 
The determination of what package contents are re- 
quired at each target takes place at the DS (and is In- 
cluded In the transfer control file) before the transfer be- 
gins. The set of package contents required by subse- 
quent targets defines the "new package". 
[0037] In the case of an open package, referenced 
packages can simply be selected and sent where re- 
quired, in the case of a closed package, the closed pack- 



4 



EP 1 014 652 A2 



8 



age and all packages it references, both directly and in- 
directly, must be sent. 

[0038] In the following example, shown in Figures 4A, 
4B and 4C, the package contains two composite pack- 
ages a and c. and three simple packages b. d and e. 
The intent is to install package a on targets P. Q, R, S 
and T. Package a is an open package, and package c 
is considered in both the open case in Figure 3B and 
the closed case shown in Figure 3C. 
[0039] In the case where package c Is open (Figure 
4B).* 

Target P has packages b and d installed and re- 
quires package e. 

Target Q has packages b and e installed and re- 
quires package d. Packages b and e are also sent 
for targets R, S and T. 

Target R has packages b and d Installed and re- 
quires package e. 

Target S has package c[de] installed and requires 
package b. 

Target T has packages d and e installed and re- 
quires package b. 

In the case where package c is closed (Figure 3C): 

Target P has packages b and d installed and re- 
quires package c[de]. 

Target Q has packages b and e installed and re- 
quires package c[de]. Package b is also sent for tar- 
gets S and T 

Target R has packages b and d installed and re- 
quires package c[de]. 

Target S has package c[de] installed and requires 
package b. 

Target T has packages d and e installed and re- 
quires packages b and c[de]. 
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[0040] The actual method of repackaging the packag- 
es constituting the software package 30 will vary con- 
siderably depending on the protocol and the architec- 
ture being used, but it can be seen that such repackag- 
ing would be very straightforward to implement usin g the 
knowledge from the transfer control file as to which 
packages are required. For example, a boolean array 
could be set up with a dimension corresponding to the 
different packages and a dimension corresponding to 
the different branches. All locations in this array are in- 
itially set to FALSE. A routine implemented on the 
repackaging node scans through the list of target nodes 
When it finds a node to which it will be transmitting along 
a certain link, it sets the array locations in the array as- 
sociated with that link in the first dimension and the 
packages required by that target node in the second di- 
mension to TRUE. Once all the target nodes have been 
considered by the routine, the contents of the software 
package required along each link will be defined by the 
locations in the array associated with that link which are 
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set to TRUE. Figure 5 shows an example of an array of 
this type for repackaging node Q in Figure 4C. 
[0041] In a second embodiment of the invention, each 
of the simple and composite packages which comprise 
an overall package do not need to be transmitted as a 
single contiguous package, but could each be transmit- 
ted separately with its own transfer control data to en- 
sure it is only transmitted to selected nodes. Each pack- 
age constituting the overall package would effectively 
then be multicasled to the appropriate nodes. If a pack- 
age is part of a closed composite package, the package 
could be provided with attachments instructing target 
nodes not to install the package until they receive the 
parent composite package. Using this alternative meth- 
od, repackaging would not have to be carried out, and 
similar overall bandwidth would be used. Indeed, a dif- 
ferent spanning tree could be used for each package 
which is being sent, depending on its target nodes. This 
is exemplified by Figs 6A and 6B. In Figure 6A, package 
B is sent over a first spanning tree over links 101. 103 
and 104. In Figure 6B. package C is sent over a second 
spanning tree over a different three links, 102, 105 and 
106. If the same spanning tree were to be used for both 
packages A and B. one package would have to be sent 
over 4 links. For example, using the tree of Figure 6A. 
package C would have to be transmitted over links 101 
102. 104 and 106. 

[0042] While preferred embodiments of the present 
invention have been illustrated and described, it will be 
understood by those of ordinary skill in the art that 
changes and modifications can be made without depar- 
ture from the invention in its broader aspects. Various 
features of the present invention are set forth in the fol- 
lowing claims. 



Claims 

1. Apparatus for hierarchical software distribution al- 
lowing distribution of a software package compris- 
ing at least a first and second package to a plurality 
of target nodes, said apparatus comprising: 

a distribution node for transmitting packages of 
software; 

a first branch node in communication with said 
distribution node, said first branch node being 
arranged to receive said software package 
from said distribution node; 

first and second target nodes, said first target 
node being in communication with said first 
branch node via a first network link, and said 
second target node being in communication 
with said first branch node via a second network 
link; said first package already being present 
on said second target node; 
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said first branch node being arranged to trans- 
mit said software package over said first net- 
work link, and said second package over only 
said second network link: 

whereby both the first and second packages 
are distributed to both said first and second tar- 
get nodes. 

2. Apparatus according to Claim 1 wherein said soft- 
ware package is sent as a contiguous package over 
said first network link. 

3. Apparatus according to Claim 1 wherein said first 
branch node is provided with information regarding 
which packages should be forwarded to which tar- 
get nodes. 

4. Apparatus according to Claim 1 wherein said first 
branch node is in communication with said first tar- 
get node via a second branch node, said second 
branch node being in communicatton with said first 
branch node via said first network link, and said sec- 
ond branch node being in communication with said 
first target node via a third network link; said second 
branch node being further in communication with a 
third target node via a fourth network link. 

5. Apparatus according to Claim 4 wherein each of 
said branch nodes is provided with Information re- 
garding target nodes to which each branch node is 
responsible for sending said packages and which 
of said first and second packages are required by 
said nodes: and wherein each branch node for- 
wards the information to subsequent nodes along 
each branch, editing said information for each 
branch to include only target nodes reached Via that 
branch. 

6. Apparatus according to Claim 4 wherein each of 
said branch nodes is provided with information re- 
garding the target nodes which require each of said 
first and second packages, and is further provided 
with information regarding which of said target 
nodes said branch node is responsible for forward- 
ing information to from said distribution node and 
which immediate branches the branch node uses 
to reach each of said target nodes for which it is 
responsible; whereby each branch node can ascer- 
tain which packages should be fonwarded along 
each immediate branch. 

7. Apparatus according to Claim 1 wherein said first 
package comprises at least two sub-packages and 
wherein installation of said two subpackages on 
each of said target nodes must be performed In a 
specified order; wherein 



10 

installation of one of said sub-packages has al- 
ready occurred on said first target node; and 
wherein 

both of said subpackages are distributed to said 
first target node and neither of the sub-packag- 
es are sent to said second target node. 

8. Apparatus for hierarchical software distribution al- 
lowing distribution of a software package compris- 
ing at least a first and second package to a plurality 
of target nodes, said apparatus comprising: 

a distribution node for transmitting packages of 
software; 

at least one branch node in communication with 
said distribution node; and 

first and second target nodes, said first target 
node being in communication with said branch 
node via a first network link, and said second 
target node being in communication with said 
branch node via a second network link; said first 
25 package already being present on said second 

target node; 
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each branch node being arranged to receive 
each of said first and second packages from 
said distribution node independently; 

said branch node being arranged to transmit 
said first package via said first and second net- 
work links to said first and second target nodes, 
and said second package via said second net- 
work link to said second target node; and 

said target nodes being arranged to install each 
package once the package is received; 

whereby the complete software package is in- 
stalled on each of said first and second target 
nodes. 

A method of distributing a software package to at 
least a first and second target node over at least 
one common network link, said software package 
comprising at least a first package and a second 
package, said first software package already being 
present on said second node, said method compris- 
ing the steps of 



sending said software package over said com- 
mon network link; thereafter sending only said 
55 second package to said second target node; 

and sending said software package to said first 
target node; 



6 



EP1 014 652 A2 

whereby both the first and second packages 
are distributed to both said first and second tar- 
get nodes. 
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FIG. 4B 
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Method and apparatus for software distribution 



(57) A software distribution system for software bun- 
dles comprising a plurality of packages, at least two of 
which are required by a different set of nodes on a net- 
work. Rather than send all of the packages constituting 
the software bundle to each node, only the packages in 



the software bundle which are required by nodes below 
a certain link in the network are sent along that link, 
whereby to minimize the bandwidth required for trans- 
mitting the software bundle. 
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